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DISTRIBUTION OF EIGENVALUES OE REAL SYMMETRIC 
PALINDROMIC TOEPLITZ MATRICES AND CIRCULANT 

MATRICES 


ADAM MASSEY, STEVEN J. MILLER, AND JOHN SINSHEIMER 

Abstract. Consider the ensemble of real symmetric Toeplitz matrices, each 
independent entry an i.i.d. random variable chosen from a fixed probability 
distribution p of mean 0, variance 1, and finite higher moments. Previous inves¬ 
tigations showed that the limiting spectral measure (the density of normalized 
eigenvalues) converges weakly and almost surely, independent of p, to a distri¬ 
bution which is almost the standard Gaussian. The deviations from Gaussian 
behavior can be interpreted as arising from obstructions to solutions of Dio- 
phantine equations. We show that these obstructions vanish if instead one 
considers real symmetric palindromic Toeplitz matrices, matrices where the 
first row is a palindrome. A similar result was previously proved for a related 
circulant ensemble through an analysis of the explicit formulas for eigenvalues. 
By Cauchy’s interlacing property and the rank inequality, this ensemble has 
the same limiting spectral distribution as the palindromic Toeplitz matrices; 
a consequence of combining the two approaches is a version of the almost sure 
Central Limit Theorem. Thus our analysis of these Diophantine equations pro¬ 
vides an alternate technique for proving limiting spectral measures for certain 
ensembles of circulant matrices. 
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1. Introduction 

1.1. History. Random matrix theory has successfully modeled many complicated 
systems, ranging from energy levels of heavy nuclei in physics to zeros of L-functions 
in number theory. For example, while the nuclear structure of hydrogen is quite 
simple and amenable to description, the complicated interactions of the over 200 
protons and neutrons in a Uranium nucleus prevent us from solving the Hamil¬ 
tonian equation (let alone even writing down the entries of the matrix!). Similar 
to statistical mechanics, the complexity of the system actually helps us describe 
the general features of the solutions. Wigner’s great insight was to approximate 
the infinite dimensional Hamiltonian matrix with the limit of IV x iV real symmet¬ 
ric matrices chosen randomly (each independent entry is chosen from a Gaussian 
density; this ensemble of matrices is called the GOE ensemble). For each N one 
can calculate averages over the weighted set of matrices, such as the density of or 
spacings between normalized eigenvalues. Similar to the Gentral Limit Theorem, 
as N —>■ oo with probability one we have that the behavior of the normalized eigen¬ 
values of a generic, randomly chosen matrix agrees with the limits of the system 
averages. 

Instead of choosing the entries of our matrices from Gaussian densities, we could 
instead choose a nice probability distribution p, for example, a distribution with 
mean 0, variance 1 and finite higher moments. For real symmetric matrices with 
independent entries i.i.d.r.v. from suitably restricted probability distributions, the 
limiting distribution of the density of normalized eigenvalues is the semi-circle den¬ 
sity (see |Wig| IMeh| 1. While there is universality in behavior of the density of 
normalized eigenvalues, much less can be proved for the distribution of normalized 
spacings; though extensive numerical investigations support the conjecture that the 
behavior is the same as the GOE ensemble, this cannot be proved for general p. 

It is a fascinating question to impose additional structure on the real symmetric 
matrices, and see how the behavior changes. The GOE ensemble has N{N + l)/2 
independent parameters, the with i < j € {!,...,TV}. For sub-ensembles, 
different limiting distributions arise. For example, to any graph G one can associate 
its adjacency matrix Hg, where atj is the number of edges connecting vertices 
i and j. If G is a simple d-regular graph with no self-loops (there is at most 
one edge between two vertices, each vertex is connected to exactly d vertices, and 
there are no edges from a vertex to itself), its adjacency matrix is all O’s and I’s. 
Such graphs often arise in network theory. The eigenvalues of these adjacency 
matrices are related to important properties of the graphs: all eigenvalues lie in 
[—d, d], d is a simple eigenvalue if and only if the graph is connected, and if the 
graph is connected then the size of the second largest eigenvalue is related to how 
quickly information propagates in the network (see, for example, ESv]). Instead 
of choosing the matrix elements randomly, for each N there are only finitely many 
N X N d-regular graphs, and we choose uniformly from this set. While d-regular 
graphs are a subset of real symmetric matrices, they have different behavior. McKay 
[McK| proved the density of eigenvalues of d-regular graphs is given by Kesten’s 
Measure, not the semi-circle; however, as d oo the distributions converge to 
the semi-circle density. Interestingly, numerical simulations support the conjecture 
that the spacings between normalized eigenvalues are the same as the GOE; see for 
example | JMRR| . 

Thus by examining sub-ensembles, one has the exciting possibility of seeing new, 
universal distributions and behavior; for adjacency matrices of d-regular graphs, 
only dN/2 of the possible N{N — l)/2 edges are chosen, and the corresponding a^- 
(which equal 1) are the only non-zero entries of the adjacency matrices. Recently 
the density of eigenvalues of another thin subset of real symmetric matrices was 
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studied. Recall an iV x TV Toeplitz matrix An is of the form 


An 


f bo bi 

b-i bo 

b-2 &-1 


&2 

bi 

bo 


6jv_i ^ 

bN-2 

bN-3 


Ciij bj — 2. 


( 1 . 1 ) 


\ bi-N b2-N bo-N 


bo / 


Bai |Bai| proposed studying the density of eigenvalues of real symmetric Toeplitz 
matrices with independent entries independently drawn from a nice distribution 
p. As a Toeplitz matrix has N degrees of freedom (the bi’s), this is a very thin 
sub-ensemble of all real symmetric matrices, and the imposed structure leads to 
new behavior. 

Initial numerical simulations suggested that the density of normalized eigenval¬ 
ues might converge to the standard Gaussian density; however, Bose-Chatterjee- 
Gangopadhyay jBGGj showed this is not the case by calculating the fourth moment 
of the limiting spectral measure (see Definitions II .11 and 11 .dll of the normalized 
eigenvalues. The fourth moment is 2|, close to but not equal to the standard 
Gaussian density’s fourth moment of 3. Bryc-Dembo-Jiang IBml (calculating the 
moments using uniform variables and interpreting the results as volumes of solids 
related to Eulerian numbers) and Hammond-Miller |hM| (calculating the moments 
by solving systems of Diophantine equations with obstructions) then independently 
found somewhat intractable formulas for all the moments, and further quantified 
the non-Gaussian behavior. The analysis in IHHl shows that the moments of the 
Toeplitz ensemble grow fast enough to give a distribution with unbounded support, 
but significantly slower than the standard Gaussian’s moments (the ratio of the 
Toeplitz moment to the standard Gaussian’s moment tends to zero as fc —> oo). 

In |HM| it was observed that their techniques may be applicable to a related 
ensemble. Specifically, by imposing an additional symmetry on the matrices by 
requiring that the first row be a palindrome (see C3), the obstructions to the 
Diophantine equations vanish and the limiting spectral measure converges weakly, 
in probability and almost surely to the standard Gaussian (see a for the exact 
statements). Bose and Mitra Hm] proved weak convergence for an ensemble closely 
related to our palindromic Toeplitz matrices (see 14.411 for the ensemble they stud¬ 
ied). They combined explicit expressions for the eigenvalues of circulant matrices 
and probabilistic arguments to construct the empirical spectral distribution; with 
these in place, they then show the limiting spectral distribution is the standard 
Gaussian. 

We show in Theorem 1331 that our analysis gives an alternate proof of Bose and 
Mitra’s result. We generalize the linear algebra arguments described in |HM| to 
analyze the Diophantine equations that arise. The eigenvalues of our palindromic 
Toeplitz ensemble are interlaced with those of the circulant ensemble of (14.411 . By 
Gauchy’s interlacing property ITemma, 14.111 and the rank inequality (Lemma ^31 j 
our analysis of the Diophantine equations related to the palindromic Toeplitz en¬ 
semble provides an alternate proof of the limiting spectral measure of the circulant 
ensemble in (lOl . This equivalence may be of use to other researchers studying 
related ensembles, as we have replaced having to calculate and work with explicit 
formulas for eigenvalues to solving a system of Diophantine equations without ob¬ 
structions. Additionally, this equivalence leads to a version of the almost sure 
Gentral Limit Theorem (see Theorem 11 .fill . 


1.2. Notation. We briefly review the notions of convergence examined in this pa¬ 
per (see m for more details) and define the quantities studied. We consider real 
symmetric palindromic Toeplitz matrices whose independent entries are i.i.d. ran¬ 
dom variables chosen from some distribution p with mean 0, variance 1, and finite 
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higher moments. For convenience we always assume N is even. Thus our matrices 
are of the form 


An = 


( bo 

bi 

b 2 

... b 2 

bi 

bo 

\ 

bi 

bo 

bi 

bo 

b 2 

bi 


b 2 

bi 

bo 

••• 64 

bo 

b 2 



bo 

64 

... bo 

bi 

b 2 


bi 

&2 

bo 

... 61 

bo 

bi 


\ bo 

bi 

b 2 

&2 

bi 

bo 

/ 


( 1 . 2 ) 


Each NxN matrix An is parametrized by N/2 numbers: bo{AN), ■ ■ ■, &Ar/ 2 -i(^Af)- 
We may thus identify such NxN real symmetric palindromic Toeplitz matrices 
with vectors in 

For each integer N let ^In denote the set of TV x TV real symmetric palindromic 
Toeplitz matrices. We construct a probability space (Oat, Pat) by setting 


IPa G f^A : biN^AN) € [ckj, /?i] for i G {0,..., TV/2 — 1}}) 

“ rPi 

= / p{xi)dxi, 


(1.3) 


where each dxi is Lebesgue measure. To each An € flv we attach a spacing measure 
by placing a point mass of size 1/TV at each normalized eigenvalue^ Ai(AAr): 


r ^ X ( K{An)\ j 

fj.A!^{x)dx = ■^2^^o\^x - -j^jdx, 


(1.4) 


where 5(x) is the standard Dirac delta function. We call pan the normalized 
spectral measure associated to An- 

Definition 1.1 (Normalized empirical spectral distribution). Let An be an N x N 
real symmetric matrix with eigenvalues Xn > ■ • ■ > Ai. The normalized empirical 
spectral distribution (the empirical distribution of normalized eigenvalues) P^n/Vn 
is defined by 

#{i < TV : XJVN < x} 


F^^/A^{x) = 


TV 


(1.5) 


As ^ ^fiAN{t)dt, we see that F^^i/Vn -g |.]^g cumulative distri¬ 

bution function associated to the measure pan ■ 

We are interested in the behavior of a typical F^’^/AN as TV ^ oo. Our main 
results are that converges to the cumulative distribution function of the 

Gaussian (we describe the type of convergence in D- Thus let Mm equal the 
^th jaoment of the standard Gaussian (so M 2 k = (2fc — 1)!! and M 2 k+i = 0). As 
there is a one-to-one correspondence between TV x TV real symmetric palindromic 
Toeplitz matrices and we may study the more convenient infinite sequences. 

Thus our outcome space is On = {6o, • • ■ }, and if w = (wq, wi,...) € On then 

rd. 

Prob(a;i G [ai,/?i]) = / p{xi)dxi. (1.6) 

J OLi 

We denote elements of by A to emphasize the correspondence with matrices, 
and we set A/v to be the real symmetric palindromic Toeplitz matrix obtained by 


^From the eigenvalue trace lemma (Trace(A^) = A?(Aiv)) and the Central Limit Theorem, 
we see that the eigenvalues of Ajn are of order vW. This is because Trace(A^) = 
and since each aij is drawn from a mean 0, variance 1 distribution, Trace(A^) is of size . This 
suggests the appropriate scale for normalizing the eigenvalues is to divide each by x/N. 
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truncating A = (bo, bi,...) to (bo, ■. ■, bN/ 2 -i)- We denote the probability space by 
(nN,iFN,PN). 

To each integer m > 0 we define the random variable Xm;N on by 

/ OO 

x^dF^«/'^(x)-, (1.7) 

-OO 

note this is the moment of the measure ■ 

We investigate several types of convergence. 

( 1 ) (Almost sure convergence) For each m, Xm-M Xm almost surely if 

Pn ({A G : Wm;Ar(A) ^ Am(A) as IV ^ oo}) = 1; (1.8) 

(2) (In probability) For each m, X^-^n in probability if for all e > 0, 

lim PN(|X„;Ar(A) - W„(A)| > e) = 0; (1.9) 

iV—»-oo 

(3) (Weak convergence) For each m, Xm-N —*■ Xm weakly if 

¥n(XmMA) <x) ^ P(A„(A) < x) ( 1 . 10 ) 

as N ^ OO for all x at which Fx^(x) = F(Xm(A) < x) is continuous. 

Alternate notations are to say with probability 1 for almost sure convergence and 
in distribution for weak convergence; both almost sure convergence and convergence 
in probability imply weak convergence. For our purposes we take Xm as the random 
variable which is identically Mm (thus Xm(A) = Mm for all A G 11^). 

Our main tool to understand the is the Moment Convergence Theorem 

(see m for example); our analysis is greatly simplified by the fact that we have 
convergence to the standard normal. 

Theorem 1.2 (Moment Convergence Theorem). Let {F 7 v(a:)} be a sequence of 
distribution functions such that the moments 

/ OO 

x'^dFM(x) (1.11) 

-OO 

exist for all m. Let $ be the distribution function of the standard normal (whose 
moment is Mm)- ^/limiv^oo Mm,N = Mm then limjv^oo Fn(x) = <i>(a;). 

Definition 1.3 (Limiting spectral distribution). If as N ^ oo we have 
converges in some sense (for example, weakly or almost surely) to a distribution F, 
then we say F is the limiting spectral distribution of the ensemble. 

In m.3l we state our main results about the type of convergence of the . 

The limiting spectral distribution will be the distribution function of the standard 
normal. The analysis proceeds by examining the convergence of the moments. For 
example, assume for each m that we have Xm,N(A) Mm almost surely. If 

Bm = {A G flpi ■ Xm;N(A) 7 ^ Mm SS N —> Oo}, (1.12) 


then F(Bm) = 0 and thus 



(1.13) 


This and the Moment Convergence Theorem allow us to conclude that with prob¬ 
ability 1, F^«/'^(x) converges to <l>(a:). 
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1.3. Main Results. By analyzing the moments of the /iAjv (for d^n N x N real 
symmetric palindromic Toeplitz matrix), we obtain results on the convergence of 
pAfi/^/N distribution function of the standard normal. The moment of 

liAnix) is 

pOO 1 ^ 

Mm{AN) = / x'^HAM{x)dx = '^K'iAN)- (1.14) 

Definition 1.4. Let MmiN) be the average of over the ensemble, with 

each A weighted by its distribution. Set Mm = limjv^oo Mm(fV). ITe call Mm{N) 
the average moment, and Mm the limit of the average moment. 

While we have two different definitions of Mm (we have defined it as both the 
m^h jjioment of the standard Gaussian as well as the limit of Mm{N)), in Theorem 
lO we prove that the Mm{N) converge to the moments of the standard Gaussian 
density, independent of p. Thus the two definitions are the same. Specifically, 
lim^v^oo AIm{N) = {2k — 1)!! if m = 2fc is even, and 0 otherwise. Once we show 
this, then the same techniques used in |hM| allow us to conclude 

Theorem 1.5. The limiting spectral distribution of real symmetric palindromic 
Toeplitz matrices whose independent entries are independently chosen from a prob¬ 
ability distribution p with mean 0, variance 1 and finite higher moments, converges 
weakly, in probability and almost surely to the cumulative distribution function of 
the standard Gaussian, independent of p. 

We sketch the proof, which relies on Markov’s method of moments. While this 
technique has been replaced by other methods (which do not have as stringent re¬ 
quirements on the underlying distribution), the method of moments is well suited to 
random matrix theory problems, as well as many questions in probabilistic number 
theory (see Eni). 

By the eigenvalue trace lemma, 

N 

= TraceiAf^) = (1-15) 

2—1 l<Zi,...,2m,<A/' 

Applying this to our palindromic Toeplitz matrices, we have 

Mim{Af) = K[Mm{Af,[)] = ^ ^ ^ ^ 

l<2l ,.. .,2^ < 

( 1 - 16 ) 

where by E(- • •) we mean averaging over the N x N palindromic Toeplitz ensemble 
with each matrix A^i weighted by its probability of occurring; thus the bj are 
i.i.d.r.v. drawn from p. We show in ^that the Mm = hm^v^oo-Vn(fV) are the 
moments of the standard Gaussian density. The odd moment limits are easily shown 
to vanish, and the additional symmetry (the palindromic condition) completely 
removes the obstructions to the system of Diophantine equations studied in |HM| . 
Gonvergence in probability follows from 

hm {E[Mm{ANf]-E[MmiAN)]'^) = 0, (1.17) 

N—*oo 

Chebyshev’s inequality and the Moment Gonvergence Theorem, while almost sure 
convergence follows from showing 

lim E[\Mm{AN)-E[Mm{AN)]\^] = Om{^, (1.18) 

N^oo 

and then applying Chebyshev’s inequality, the Borel-Cantelli Lemma and the Mo¬ 
ment Convergence Theorem. Analogues of these estimates are proven in |HM| for 
the ensemble of real symmetric Toeplitz matrices by degrees of freedom arguments 
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concerning the tuples (ji, ... ,im)- The palindromic structure does not change the 
number degrees of freedom, merely the contribution from each case. Thus the ar¬ 
guments from [HM) are applicable, and yield both types of convergence. We sketch 
these arguments in isi In a we investigate related ensembles. In particular, we 
show our techniques apply to real symmetric palindromic Hankel matrices, with 
Theorem 11.51 holding for this ensemble as well. Further, we show that the limit¬ 
ing spectral distribution of the palindromic Toeplitz ensemble is the same as that 
of Bose and Mitra’s symmetric Toeplitz ensemble, implying that our Diophantine 
analysis is equivalent to their analysis of the explicit formulas for the eigenvalues 
of their ensemble. 

One particularly nice application of the correspondence between these two en¬ 
sembles is that we obtain a version of the almost sure Central Limit Theorem for 
certain weighted sums of independent random variables. Specifically, in ^we show 


Theorem 1.6. Let Xi, X 2 ,... be independent, identieally distributed random vari¬ 
ables from a distribution p with mean 0, variance 1, and finite higher moments. For 
oj = {xi,X2, ■ ■ ■) setXfiuj) =X£, and consider the probability spaee (n,ir, P) (where 
P is induced from Prob(X^(a;) < x) = J((^p{t)dt). Let 




Then 


Lo G LI : sup 

I xGM 




\/W2 


0 as n ^ 00 


= 1 ; 


(1.19) 


( 1 . 20 ) 


here L denotes the indicator function and <1> is the distribution function of the stan¬ 
dard normal: 


$(x) = 


1 


"/^dt. 


( 1 . 21 ) 


yf 277 J—cc 

We conclude in (IHlby investigating the spacings between normalized eigenvalues 
of palindromic Toeplitz matrices. 


2. Calculating the Moments 

Many of the calculations below are similar to ones in |hM| . the difference being 
that the additional symmetries imposed by the palindromic condition remove the 
obstructions to the Diophantine equations. Our main result, needed for the proof 
of Theorem n~5i is that 

Theorem 2.1. For the ensemble of real symmetrie palindromie Toeplitz matrices 
with independent entries chosen independently from a probability distribution p with 
mean 0, variance 1 and finite higher moments, each (the limit of the average 
moments of the normalized empirical spectral measures) equals the moment of 
the standard Gaussian density. Specifically, M 2 k+i = 0 and M 2 k = (2fc—1)!!, where 
{2k - 1)!! = (2fc - 1) • (2A: - 3) • • • 3 • 1. 

We prove Theorem 12.II in stages. In >12.II we show that the odd moments vanish, 
and that the limit of the average zeroth and second moments are 1. Determining 
the moments is equivalent to counting the number of solutions to a system of 
Diophantine equations. In >12.21 we prove some properties of the Diophantine system 
of equations, which we then use in a to show that M4, the limit of the average 
fourth moment as N —> 00 , equals that of the standard Gaussian density. As we 
can always translate and rescale a probability distribution with finite moments to 
have mean 0 and variance 1, the first moment that shows the shape of an even 
distribution is the fourth. This supports the claim that the palindromic condition 
removes the obstructions. We then use linear algebra techniques (and the ability 
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to solve several Diophantine equations at once) to show that the limits of all the 
even average moments agree with those of the standard Gaussian density in El 
We introduce some notation. Let A]\[ he an N x N real symmetric palindromic 
Toeplitz matrix. We write for the entry in the i*** row and column. We 
determine which entries are forced to have the same value as is a 

real symmetric palindromic Toeplitz matrix, if is forced to have the same 

value then either (1) it is on the same diagonal; (2) it is on the diagonal obtained 
by reflecting the diagonal is on about the main diagonal; (3) it is on the 

diagonal corresponding to ; (4) it is on the diagonal obtained by 

reflecting about the main diagonal the diagonal corresponding to . 

In other words, 


+ l + I i^ S I ■ - 

Pm +1 I 

where we set zat+i equal to ii. Equivalently, 


1 


l^n +1 '^n| 

^ 1 l^n+1 '^n|j 


{ ^m +1 

^m +1 


i('^n+l ’^n) 

±(i„+i - in) + {N -1) 
±{in+l - in) - (N - 1). 


( 2 . 1 ) 


( 2 . 2 ) 


We denote the common value by and use ba to refer to a generic diagonal 

(thus the Qij’s refer to individual entries and the ba’s refer to diagonals). Each 
such matrix is determined by choosing ^ numbers independently from p, the b^ 
with aG{0,...,Y~l}- The moments are determined by analyzing the expansion 
for MkiN) in (tTTHIl . We let pk denote the fc*** moment of p, which is finite by 
assumption. 

We often use big-Oh notation: if g{x) is a non-negative function then f{x) = 
0{g{x)) (equivalently, f{x) <C g{x)) if there are constants Xo,C > 0 such that for 
all X > xq, \fix)\ < Cg(x). If the constant depends on a parameter m we often 
write <Cm or Om- 


2.1. Zeroth, Second and Odd Moments. 

Lemma 2.2. Assume p has mean zero, variance one and finite higher moments. 
Then Mo = 1 and M 2 = 1. 

Proof. For all N, Mq^A^) = Mo{N) = 1. For the second moment, we have 
M2{N) = ^ ^ lE(aqi2 ’ ^* 2 * 1 ) 

= = VI E E(6f,,_,,|). (2.3) 

l<2l,22<iV 

As we have drawn the b^s from a variance 1 distribution, the expected value above 
is 1. Thus M 2 {N) = ^ = 1, so M 2 = limAr^oo M 2 {N) = 1 also. □ 

Note there are two degrees of freedom. We can choose to be on any diagonal. 
Once we have specified the diagonal, we can then choose ii freely, which now 
determines ^ 2 - 

Lemma 2.3. Assume p has mean zero, variance one and finite higher moments. 
Then M 2 k+i = 0. 

Proof. For m = 2k + 1 odd, in itTT^ at least one ba occurs to an odd power. If 
a ba occurs to the first power, as the expected value of a product of independent 
variables is the product of the expected values, these terms contribute zero. Thus 
the only contribution to an odd moment come when each ba in the expansion occurs 
at least twice, and at least one occurs three times. 
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There are at most fc + 1 degrees of freedom. There are at most k values of ba 
to specify, and then once any index ie is specified in OBI) . there are at most 8 
values (coming from the four possible diagonals in (12.21) 1 for each remaining index. 
Therefore of the tuples {ii,..., i2fe+i), there are only 0(iV^+^) tuples where 

the corresponding b^’s are matched in at least pairs. 

Consider such a tuple. Assume there are r < k different b^, say b^^,... ,ba^, 
with baj occurring rij > 2 times (and further at least one rij > 3). Such an 
(A, • • ■, i 2 k+i) tuple contributes 0^=1 E [hii] = n;=i to M 2 k+i{N), where pj 

is the j**' moment of p and hence finite. Thus this term contributes Ofc(l) (where 
the constant depends on fc); in fact the constant is at most maxj<2fc+i(|Pj |^, !)• 
Thus 

M2k+i{N) «, N-K (2.4) 

iV 2 + 

so M^k+i = limAT^oo Af2fc+i(fV) = 0, completing the proof. □ 

2.2. Higher Moments. We expand on the method of proof of Lemma 12.31 to 
determine the even moments. We must find the N oo limit of 

Af2fc(A^) = jyk+l ^ ^ E(flni2®i2j3 ' ' ' )• (2-5) 

l<n ,...,i 2 fc<A' 

If the tuple (*1 ,... ,i 2 k) has r different b^, say ba^,... ,ba^, with ba^ occurring rij 
times, then the tuple contributes 

Lemma 2.4. The tuples in (1^ where some Uj ^ 2 contribute Ofc(^) to M 2 k{N), 
the average 2fc*^ moment. Thus, as N ^ oo, the only tuples that contribute to M 2 k 
are those where the are matched in pairs. 

Proof. If an Uj = 1 then the corresponding 6 q,^. occurs to the first power. Its 
expected value is zero, and thus there is no contribution from such tuples. Thus 
each Uj > 2, and the same argument as in Lemma o shows that each tuple’s 
contribution is 0^(1). If an Uj > 3 then the corresponding baj occurs to the third 
or higher power, and there are less than fc +1 degrees of freedom (there are Ok{N^) 
tuples where each Uj > 2 and at least one Uj >3). As each tuples’ contribution 
is Ofc(l)i and we divide by in lITBl) . then the total contribution from these 

tuples to M 2 k{N) will be Ofe(^). So in the limit as iV ^ oo the contribution to 
M 2 k from tuples with at least one Uj > 3 is 0. □ 

Remark 2.5. Therefore the ba ’s must be matched in pairs. There are fc + I 
degrees of freedom (we must specify values ofba^, •.. ,baf,, and then one index if,). 
It is often convenient to switch viewpoints from having these k pairings and one 
chosen index to having fc+1 free indices to choose, and we do so frequently. Another 
interpretation of Lemma \2.4\ is that of the tuples, only have a chance 

of giving a non-zero contribution to M 2 k{N)- As any tuple contributes at most 
Ofc(l) to M 2 k{N), in the arguments below we constantly use degree of freedom 
arguments to show certain sets of tuples do not contribute as N ^ oo (specifically, 
any set of tuples of size Ok{N^) contributes Ok{^) to M 2 k{N)). 

From E2), if is paired with then one of the following holds: 

'^m+l '^n) 

'^m+1 '^n) (-^ 1) 

^m+1 '^m — '^n) !)• (^■^) 

These equations can be written more concisely. There is a choice of Ci G {0, 
±{N — 1)} (£ is a function of the four indices) such that 

^m+l '^m — '^n) T 


(2.7) 
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The following lemma greatly prunes the number of possible matchings. 

Lemma 2.6. Consider all tuples {ii,... ,i 2 k) such that the corresponding ’s are 
matched in pairs. The tuples with some paired with some by a plus 

sign in fTTh contribute Ok{^) to M 2 k{N)- Thus, as N ^ co, they contribute 0 to 
M2k- 

Proof. Each tuple (ii,... ,i 2 k) contributes 

lE[aii*2 ■ ■ • ~ ®'[^K2-*ir ■ ■ ^bi-i2fc|] (2-8) 

to M 2 k{N), and the only contributions we need consider are when the are 

matched in pairs. There are k equations of the form iTzIl : each equation has a 
choice of sign (which we denote by ei,..., Cfe) and a constant (which we denote by 
Cl,..., Cfe; note each Ci is restricted to being one of three values). We let a;i,..., Xfc 
be the values of the \im+i — im\ on the left hand side of these k equations. Dehne 
xi = h — ii, X 2 = is — 12 , ■ ■ ■, X 2 k = ii — *2fc- We have 


*2 

= *1 

- Xi 

is 

= ii 

-X 1 -X 2 

h 

= h 

Xi * * * X2k 


By the hnal relation for zi, we find 

Xi + ■ ■ ■ + X2k = 0. 


(2.9) 

( 2 . 10 ) 


Let us say that is paired with Then we have relations between 

the indices because they must satisfy one of the k relations; let us assume they 
satisfy the equation. Further, by definition there is an 77^ = ±1 such that 

im+i ~lm = this is simply because we have defined the x^’s to be the absolute 
values of the im+i — *m on the left hand sides of the k equations. We therefore have 
that 

im+l Im — ^i{in+l In) C^, (2.11) 

or equivalently that 

Xm = r]iX( = eiXn + Ci. ( 2 . 12 ) 

Since = 1, we have that 

Xn = ilteixt - etCi. (2.13) 

Therefore each xi is associated to two x’s, and occurs exactly twice, once through 
Xm = ViXi and once through Xn = rueixi — eiCi. Substituting for the x’s in (lOHl 
yields 

2k k 

'^Xm = + ee)xe - eeCi) = 0. (2.14) 

m—1 i—1 

If any eg = 1, then the xg are not linearly independent, and we have fewer than 
k + 1 degrees of freedom. There will be at most Ok{N^) such tuples, each of which 
contributes at most 0^(1) to M 2 k{N). Thus the terms where at least one = 1 
contribute Ok {^) to M 2 k{N), and are thus negligible in the limit. Therefore the 
only valid assignment that can contribute as iV —> oo is to have all = — 1 (that 
is, only negative signs in E3). □ 


Remark 2.7. The main term is when each eg = —1. In this case, immedi¬ 

ately implies that the Cg’s must sum to zero. This observation will be essential in 
analyzing the even moments. 
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Figure 1. Possible Configurations for the Fourth Moment Matchings 


2.3. The Fourth Moment. We calculate the fourth moment in detail, as the 
calculation shows how the palindromic structure removes the obstructions to the 
Diophantine equations encountered in pfM| . This will establish the techniques that 
we use to solve the general even moment in an 

Lemma 2.8. Assume p has mean zero, variance one and finite higher moments. 
Then M 4 = 3, which is also the fourth moment of the standard Gaussian density. 

Proof. From srm . the proof follows by showing 

M 4 = lim EiaijOjkOkiaii) (2-15) 

N^oo I\ 

equals 3. From Lemma 12.41 the must be matched in pairs. There are three 

possibilities (see Figure P) for matching the in pairs: 

• {i,j) and (j, fc) satisfy (I2.7|l . and (fc, Z) and {l,i) satisfy (12.711 : 

• {i,j) and {k,l) satisfy (I2.7|l . and {j,k) and {l,i) satisfy (12.711 : 

• {i,j) and {l,i) satisfy 12.711 . and {j,k) and {k,l) satisfy (12.711 . 

By symmetry (write aijOjkOkiau as auaijajkaki), the third case has the same 
contribution as the first. These two cases are examples of adjacent matchings. In 
the tuple {i,j,k,l) we have four pairs, (i,j),{j,k),(k,l) and {l,i), and we match 
the two adjacent ones. Note that while in each case it is possible for both pairs to 
be associated to the same ba {a G {0,..., ^ — 1 }), such tuples give a lower order 
contribution. We can therefore ignore the contribution when both pairs have the 
same value, as this is a correction of size 0{i^) to M 4 . Also, by Lemma f2.t)l we 
only have minus signs in m- 

Case One: Adjacent Matching. Consider the adjacent matching (which occurs 
twice by relabeling). We thus have the following pair of equations: 

j — i = —{k — j) + Ci, l — k = —{i — l) + C 2 (2-16) 

Rewriting these equations, we find that 

k = i + Ci and k = i + C 2 , (2-17) 

with Cl, C2 G {0, ±{N — 1)} and i,j, k,l G {1,..., N}. 

We divide by in H2.15|l . While we have N'^ tuples {i,j, k, 1), only the O(iV^) 
which have the matched in pairs contribute. In fact, any set of tuples of 

size O(A^) will not contribute in the limit. Thus we may assume Ci and C2 equal 
zero. For example, if Ci = iV — 1 then i is forced to equal 1, which forces k to equal 
N. Letting j and I range over all possible values still gives only such tuples. 
Similar arguments handle the case of Ci = —{N — 1). 

Thus Cl = C2 = 0; there are N choices for k G {1,..., A}, and then i is de¬ 
termined. We have N choices for j G {!,..., N} and N — 0(1) choices for I (we 
want the two pairs to correspond to different 6^, so we must choose I so that Oki 
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is not on an equivalent diagonal to a^). There are — 0{N'^) such tuples, each 
contributing 1 (the second moments of p equal 1, and we divide by N^). Thus each 
adjacent pairing case contributes 1 + 0{^) to M 4 {N). As there are two adjacent 
matching cases, as N ^ oo these contribute 2 to M4. 

Case Two: Non-adjacent Matchings. The equations for the non-adjacent case 
gives the following pair of equations: 

j — i = —{l — k) + Ci k — j = —{i — l) + C 2 , (2-18) 

or equivalently 

j = i + k — I + Cl = i + k — I — C 2 - (2-19) 

We see that Ci = —C 2 , or Ci + C2 = 0. 

In |HM) . as N ^ 00 this non-adjacent pairing contributed | to M4, and was 
responsible for the non-Gaussian behavior. The difference is that in |HM| we had 
the relation j = i + k — I without the additional factor Ci S {0, ±(7V — 1)}. The 
problem was that we required each i,j,k,l € {1 ,...,A}; however, if we choose 
i, k and I freely then j may not be in the required range. For example, whenever 
i,k> and I < then j > N; thus for the Toeplitz ensemble at least of 

the tuples that “should have” contributed 1 instead contributed 0. 

We now show this does not happen for the palindromic Toeplitz ensemble. 
For any i,k,l G A^} there is a choice of Ci G {0,±(fV — 1)} such that 

j G N} as well. The choice of Ci is unique unless i + k — I G {1, N}, but 

this is an additional restriction (i.e., we lose a degree of freedom because an ad¬ 
ditional equation must be satisfied) and there are only 0 {N^) triples {i,k,l) with 
i + k — I G {1, N}. Thus there are again + O(iV^) tuples, each with a contribu¬ 
tion of 1 (if all four are on equivalent diagonals then this is again a lower 

order term, as there are at most O(iV^) such tuples). As there is one non-adjacent 
matching case, as ^ 00 this contribute 1 to M 4 . 

Adding the contribution from the two cases gives a value of 3 for M4, the limit 
of the average fourth moment, completing the proof. □ 

2.4. The General Even Moment. We now address the general case. Using the 
linear algebra techniques highlighted in the fourth moment calculation, we complete 
the proof of Theorem EH by showing the limit of the even average moments, the 
M2fe’s, agree with the even moments of the standard Gaussian density. 

Fix an even number 2k > 6. By Lemma El the must be matched in 

pairs. Each pair satisfies an equation like E3, and by Lemma 12.HI the negative 
sign must hold. There are {2k — 1)!! ways to match^ the 2k objects in pairs. The 
proof of Theorem l2.1l is completed by showing that each of the {2k — 1)!! matchings 
contributes 1 to M2fe, as this then implies that M 2 k = (2fc — 1)!!. 

Consider any matching of the 2k pairs of indices (zi, 12), (*2, *3), • ■ ■, (z2fc, zi) into 
k pairs. We obtain a system of k equations. Each equation is of a similar form; for 
definiteness we describe the equation when (zm,Zm-i-i) is paired with (z„,z„+i): 

Zm-t-l im — (Zn-l-1 in) Gj, (2.20) 


^There are {^^{ ways to choose the first two objects to be paired, {^^ 2 ^) to choose the 

second two objects to be paired, and so on. As order does not matter, there are fc! ways to arrange 
the k pairs. Thus the number of matchings is 






REAL SYMMETRIC PALINDROMIC TOEPLITZ MATRICES AND CIRCULANT MATRICES 13 


where as always i 2 k+i = *i, each index is in {1,... ,N} and Cj G {0,±(A^ — 1)}. 
We may re-write as 

Cj = im+l '^n+1 (2.21) 

Note that if we write the k equations in the form given by then each index 

ia occurs exactly twice. It occurs once with a coefficient of -1-1 and once with a 
coefficient of —1. This is because the index ia occurs in exactly two pairs of indices, 
in (where it has a -fl) and in {ia,ia+i) (where it has a —1). 

It is useful to switch between these two viewpoints and IE2IJ), and we do 

so below. We have k equations and 2k indices. We show there are fc -|- 1 degrees of 
freedom. In fact, more is true. In the results that follow, we show fc-f 1 of the indices 
can be chosen freely in TV}, and for each choice, there is a choice of the Cj’s 

such that there are values for the remaining k — 1 indices in TV}, and all k 

equations hold. This means that each of the TV^+^ tuples (coming from choosing 
fc -I- 1 of the indices freely) contributes 1, which shows this matching contributes 1 
to M 2 k- 

We first show how to determine which fc -|- 1 of the 2k indices we should take as 
our free indices. Determining a good, general procedure for finding the right free 
indices for an arbitrary choice of the (2k — 1)!! matchings was the hardest step in 
the proof. 

Lemma 2.9. Consider the system of k equations above, where each is of the form 
described in We may number the equations from 1 to k and choose fc -|- 1 

indices to be our free indices in such a way that only the last equation has no 
dependent indices occurring for the first time. For the first k — 1 equations, there is 
always a dependent index occurring for the first time, and there is always a choice 
of the Cj’s so that the dependent indices in the first k — 1 equations take on values 
in {1,...,TV}. 

It is important that in each equation only one dependent index occurs for the 
first time. The reason is that we are trying to show TV^+^ -|- Ok(N^) of the TV^+^ 
choices of the independent indices lead to valid configurations. If there were an 
equation with dependent indices whose values were already determined, then we 
would have restrictions on the independent indices and a loss of degrees of freedom. 
We shall handle the last equation later (as clearly every index occurring in the last 
equation has occurred in an earlier equation). 

Proof. Choose any of the k equations. We shall refer to it as eq(fc). This equation 
contains exactly four indices. As this is the last equation, each index must have 
appeared in an earlier equation. Thus, eq(fc) marks the second time we have seen 
each of these four indices. 

Choose any of the four indices, and select the equation in which this index first 
appeared. There is only one such equation, as each index occurs in exactly two 
equations. We label this equation eq(fc — I), and we let the index which we have 
just chosen be one of our k — 1 dependent indices. For the other three indices, 
either two have a plus sign and the third has a negative sign (in which case our 
dependent index has a negative sign), or two have a negative sign and one has 
a positive sign (in which case our dependent index has a positive sign). Let us 
assume our dependent index has a negative sign, and consider the corresponding 
equation in the form of the case where the dependent index has a positive 

sign is handled similarly. The other three indices’ sum is in {2 — TV,..., 2TV — 1}. 
If the sum is in {2 — TV,..., 1} we may take Ck-i = —(TV — I); if the sum is in 
{!,..., TV} we may take Ck-i = 0; if the sum is in {TV,..., 2TV — 1} we may take 
Ck-i = TV — 1. In each case there is a valid choice of the dependent index. While 
if the other indices sum to 1 or TV then there are two choices of Ck-i, we shall see 
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in Lemma ITTm that this give lower order contributions and may be safely ignored 
as iV ^ oo. 

Now consider the indices in eq(A:) and eq(fc — 1); as long as at least one index 
has appeared only once in these two equations, we may continue the process. We 
choose any such index. It will be one of our dependent indices, and we label the 
unique other equation it occurs in as eq(fc — 2). 

We claim we may repeat this process until we have chosen one index from all 
but eq(fc) as a dependent index, and each equation has a dependent index which 
occurs for the first time in that equation. The only potential problem is there is 
an m > 1 such that, after we chose which equation to label eq(TO), every index in 
eq(m) through eq(A:) occurs exactly twice. If this were so, we would not be able to 
continue and choose a new dependent index and a new equation to be eq(m — 1). 
We show that there is no such m > 1. 

We prove this by contradiction. Assume not, so every index in eq(m) through 
eq(fc) occurs twice. In our initial configuration, we had 2k pairs of indices: ( 11 , 12 ), 
(* 2 , 13 ), • ■ ■, (* 2 fc,*i); note that each index is in exactly two pairs. Without loss of 
generality, assume eq(/c) has index ii. Our assumptions imply we have both ii’s, 
which means we have the pairs ( 11 , 12 ) and (i 2 k,ii)- Since we are assuming each 
index which occurs, occurs twice, we have the other 12 and the other i 2 k- Thus we 
have the pairs ( 12 , is) and (j 2 fe-i) * 2 fc)- Continuing in this manner, for m > 1 we see 
that if we were to terminate at some equation eq(m), then there would be at least 
two indices occurring only once. 

Therefore the process never breaks down. We may choose a labeling of the 
remaining k — 1 equations such that, in each equation, there is one and only one 
new dependent index occurring for the first time. The remaining fc + I indices are 
our free indices. □ 

We now have fc + 1 free indices, and fc — 1 dependent indices. There are 
choices for the fc + 1 free indices. We show that, except for Ok(N^) “bad” choices 
of indices, there are unique choices for the dependent indices and the Cj ’s such that 
all fc equations are satished, and all indices are in {I,..., IV}. As the contributions 
to M 2 k(N) are divided by the Ok(N^) bad indices contribute Ok(j^) to 

M 2 k{N), and the — Ok(N^) “good” indices contribute 1 + Ok{jf)- Thus the 
contribution to M 2 k from this matching is 1. 

Lemma 2.10. Except for Ok{N^) choices of the fc + 1 free indices, all the con¬ 
stants Cj (j € {I,..., fc — 1} j are determined uniquely in the set {0, ±(IV — 1)}, 
the dependent indices are uniquely determined in {I,..., A}, and the first fc — I 
equations are satisfied. 

Proof. By Lemma|^l each of the first fc—1 equations determines a single dependent 
index. Consider the sum of the other three indices in these equations. In proving 
Lemma El we showed that the Cj’s are unique whenever these sums are not I 
or N, and whenever a Cj was unique it lead to a unique choice of the dependent 
index in {1,..., A} such that the equation was satisfied. If the sum were either of 
these values, this would give us another equation, and a loss of at least one degree 
of freedom. This is immediate if one of the three indices is an independent index; 
if all are dependent indices, then we simply substitute for them with independent 
indices, and obtain an equation involving many indices, at least one of which is 
independent. Thus we again gain a relation among our independent indices. There 
are therefore Ok(N^) choices of the fc + 1 free indices such that the C}’s are not 
uniquely determined. □ 

Notice how in the previous lemma, the last coefficient Ck is not included. This 
is because in the above lemma we absolutely needed to be able to determine our 
dependent index (which occurred for the hrst time in eq(j)) with Cj. However, in 


REAL SYMMETRIC PALINDROMIC TOEPLITZ MATRICES AND CIRCULANT MATRICES 15 


the last equation, all the indices are determined. We therefore cannot determine 
Ck in quite the same way as we did for the other Cj’s. We now show that there is 
a valid choice of Ck for + Ok{N^) of the choices of the free indices. 

Theorem 2.11. For any of the (2k— 1)!! matchings of the 2k pairs of indices into 
k pairs, there are fc +1 free indices and k—1 dependent indices. For all but Ok{N^) 
choices of the free indices, every Cj is in {0, ±(N— 1)}, and is uniquely determined. 
Furthermore, we have = 0- Thus each matching contributes 1 + Ok(^) 

to M 2 k(N), or equivalently contributes 1 to M 2 k- Thus M 2 k = (2fc — 1)!!, the 2k^^ 
moment of the standard Gaussian density. 

Proof. We have proved much of Theorem imi in Lemmas m and mni What we 
must show now is that, for all but Ok{N^) “bad” choices of the free indices, the last 
equation is consistent. By our earlier results, we know all equations but possibly the 
last are satisfied, all dependent indices are in {1,..., TV}, and for all but the “bad” 
choices of indices, the Ci,..., Ck-i are uniquely determined and in {0, ±{N — 1)}. 

Consider now the last equation, eq(fc). From 12.2111 and the fact that all indices 
are in {1,..., A^j, we see that there is a choice of C/c G K. such that eq(fc) holds. 
We must show that Ck S {0, ±(TV — 1)}. 

We first note that Cfc G [2 — 2N, 2N — 2]. This is because each index is in [1, TV], 
and in two indices occur with a positive sign and two with a negative sign. 

We see that Ck is a multiple of — 1 by adding the k equations (eq(l) through 
eq(fc)). Each index occurs twice, once with a negative sign and once with a positive 
sign, and each Cj occurs once with a positive sign. Thus 

Ci + .-. + Cfc = 0; (2.22) 

see also Remark 12.71 As Ci,... ,Ck-i G {0,±(A^— 1)}, we obtain that Ck is a 
multiple of A^ — 1. As Ck G [—2(A^— 1),2(A^— 1)], we see that Ck G {0, ±(N — 1), 
±2(A^ — 1)}. We now show that Ck = ±2(A^ — 1) for at most Ok(N^) choices of 
the free indices. 

Consider the case when Ck = 2.N — 2; the other case is handled similarly. For 
this to be true, in eq(fc) the two indices with positive signs must equal N and the 
two indices with negative signs must be 1. If this happens, we impose relations on 
previous equations. Thus, just as in Lemma [2.1 HI we lose a degree of freedom, and 
there are only Ok(N^) choices of the free indices such that Ck = 2.N — 2. 

Therefore, Ck G {0,±(A^— 1)} and is determined uniquely (except for at most 
Ok(N^) choices), and all k equations are satisfied with indices in {1,..., A^j. □ 

This completes our proof that the limit of the average even moments, the M 2 fe’s, 
agree with the even moments of the standard Gaussian density. 

3. Convergence in Probability and Almost Sure Convergence 

Showing the limit of the average moments agree with the standard Gaussian’s 
moments is the first step in proving Theorem If ..II To complete the proof, we must 
show convergence in probability and almost sure convergence (both of which imply 
weak convergence). Fortunately, the arguments in |hM| are general enough to be 
immediately applicable for convergence in probability; a small amount of additional 
work is needed for almost sure convergence. We use the notation of ca and 
and state the minor changes needed to apply the results of |HM| to finish the proof. 
Note: in |HM| it is assumed that each bo = 0; by Lemma \J~^ we may assume bo = 0 
without changing the limiting spectral distribution of the ensemble. 

3.1. Convergence in Probability. Let A an infinite sequence of real numbers 
and let Ajv be the associated N x N real symmetric palindromic Toeplitz matrix. 
Let Xm-,N(A) be the random variable which equals the moment of the measure 
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associated to Aa? and let Mm be the moment of the standard Gaussian. Set 
XmiA) = Mm- We have Xm;N —> Xm in probability if for all e > 0 


lim Pn ({^ G On : \Xm-,N{A) — Xm{A)\ > e}) = 0. 

N—^oo 


(3.1) 


By Chebyshev’s inequality we have 

Pn ({^ G On : \Xm-,N{A) — Mm{N)\ > e}) < 


ElMmiA^n-ElMmiA^W 


(3.2) 


As Mm{N) — Mm —> 0 as iV ^ oo, it suffices to show for all m that 

lim {E[Mm{ANf]-E[Mm{AN)]'^) = 0 (3.3) 

N—*oo 

and then apply the Moment Convergence Theorem lTheorem ll.2ll . 

By ([TTHll we have 

i:[M„(yi„)>| = Y, 

l<2l 


. Jm<A' 

E[Mm{AiSf)] = jym+2 

^ (3-4) 

. Jm<A' 


There are two possibilities: if the absolute values of the differences from the i’s 
are not on equivalent diagonals with those of the j’s, then these contribute equally 
to E[Mm(AAr)^] and E[Mm(AAr)]^. We are left with estimating the difference for 
the crossover cases, when the value of an ia — ia+i = ^{j/3 — J/3+i)- The proof 
of the analogous result for the real symmetric Toeplitz ensembles in |hM) is done 
entirely by counting degrees of freedom, and showing that at least one degree of 
freedom is lost if there is a crossover. Such arguments are immediately applicable 
here, and yield the weak convergence. All that changes is our big-Oh constants; 
the important point to remember is that each Cj G {0, ±(iV — 1)}, which means 
there are at most 3^™ configurations where we apply the arguments of rmvTi . 


3.2. Almost Sure Convergence. Almost sure convergence follows from showing 
that for each non-negative integer m that 

Xm-,N{A) Xm(A) = Mm almost surely, (3.5) 

and then applying the Moment Convergence Theorem iTheorem ll . 211 . The key step 
in proving this is showing that 

lim E[\Mm{A^) -E[Mm{AN)]\^] = Om (■ (3.6) 

N^oo \I\ J 

The proof is completed by three steps. By the triangle inequality, 

\Mm{AN)-Mm\ < \Mm{AN) - Mm{N)\ + \Mm{N) - Mm\- (3.7) 

As the second term tends to zero, it suffices to show the first tends to zero for 
almost all A. 

Chebychev’s inequality states that, for any random variable X with mean zero 
and finite moment, 

Prob(|A| > e) < (3.8) 
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Note 'E[Mm{AN) — Mm{N)] = 0, and following [HMj one can show the fourth 
moment of Mm{A]\f) — Mm{N) is Om (]^); we will discuss this step in greater 
detail below. Then Chebychev’s inequality (with £ = 4) yields 


i;NiA) — > e) < 




< 


C„ 


(3.9) 


iV2e4- 

The proof of almost sure convergence is completed by applying the Borel-Cantelli 
Lemma and proving lliCT : we sketch the proof below. 

We assume p is even for convenience (though see Remark 6.17 of [hM)l a 
careful reading of the proofs in §6 of [HM| show that analogues of most of the results 
hold in the palindromic case as well, as most of the proofs are simple calculations 
based on the number of degrees of freedom. The only theorems where some care 
is required are Theorems 6.15 (see equation (50)) and 6.16 (see equation (51)). In 
those two theorems, more than just degree of freedom arguments are used; however, 
the same equations are true for each of our configurations, and thus analogues of 
these results hold in the palindromic case as well, completing the proof of almost 
sure convergence. 


4. Connection to Circulant and Other Ensembles 

We show how our analysis of the Diophantine equations associated to the en¬ 
semble of real symmetric palindromic Toeplitz matrices may be used to study the 
ensembles related to circulant matrices investigated by Bose and Mitra, as well as 
other ensembles (for example, real symmetric palindromic Hankel matrices). We 
conclude by showing the two methods combine nicely to yield an almost sure Cen¬ 
tral Limit Theorem. 

We first state two needed results. 

Lemma 4.1 (Cauchy’s interlacing property). Let A^ be an N x N real symmetric 
matrix and B be the (TV— 1) x (N — 1) principal sub-matrix of A^- If Xn > • ■ ■ > Ai 
(respectively, A)y_i > > X'l) are the eigenvalues of Am (respectively, Bm), then 

Xn > X'm-i > Xn-1 > X'm-2 > ••• > A 2 > X\ > Ai. (4.1) 

For a proof, see [DHj . For us, the important consequence of the Cauchy inter¬ 
lacing property is the Rank Inequality (Lemma 2.2 of |Baip : 

Lemma 4.2 (Rank Inequality). Let Am and Bm be N x N Hermitian matrices. 
Then 

snp\F^^ {x) - {x)\ < ^^’4k(A^-Rjv) _ 

kgr Xf 

To prove the equivalence of our methods with the direct analysis of explicit 
formulas for eigenvalues, all we need is a simple consequence of the Rank Inequality: 

Lemma 4.3 (Special Case of the Rank Inequality). Let Am be an N x N real 
symmetric matrix with principal (N — I) x (N — 1) sub-matrix Bm-i- Then 

sup \ F ^^{ x )- F ^^-^{ x )\ < 4. (4.3) 

a;GR XJ 

Proof. We may extend Bm-i to he an N x N real symmetric matrix Bm by setting 
all entries of Bm in either the row or the column (but not both) equal to 
zero, and the entry bMN to any number we wish. We may now apply Lemma, l4.2l to 
Am and Bm (with rank(Ajv — Bm) < 2), and then note that the spectral measures 
of Bm -1 and Bm are close. □ 

We frequently use the rank inequality to show that two N x N matrices with 
common (N — I) x (N — 1) principal sub-matrix have empirical spectral measures 
differing by negligible amounts (as N —>■ 00 ). 





18 


ADAM MASSEY, STEVEN J. MILLER, AND JOHN SINSHEIMER 


4.1. Circulant Ensembles. Bose and Mitra (see page 9 of [BMp study what they 
call symmetric Toeplitz matrices; their matrices are of the form 


Sn = N-^/^ 


Xo 

Xi 

X2 

XN-2 

XN-1 

Xi 

Xo 

Xi 

XN-3 

XN-2 

XN-1 

XN-2 

XN-3 ■ ■ 

Xi 

Xo 


= Xi 


They prove the limiting spectral distribution exists for this ensemble, and is the 
standard Gaussian. Their proof starts with explicit formulas for the eigenvalues of 
the matrices in terms of the matrix entries. The rest of the argument is similar 
to their analysis of the empirical eigenvalue distribution of circulant matrices. The 
normality of the limiting spectral distribution (i.e., it being the standard Gaussian) 
follows from a detailed analysis of the eigenvalues, and requires several explicit 
computations. In our analysis, the normality is a consequence of each matching 
contributing fully, and allows us to avoid having to compute detailed properties of 
the eigenvalues of the ensemble. 


Theorem 4.4. The ensembles of real symmetric palindromic Toeplitz matrices (see 
and symmetric Toeplitz matrices (see llOIl l have the same limiting spectral 
distribution when the independent entries are chosen from a distribution with mean 
0, variance 1 and finite higher moments. 


Proof. The ensemble of 2N x 2N real symmetric palindromic Toeplitz matrices (see 
(11.211 1 is almost, but not quite, the same as the (2iV — 1) x {2N — 1) symmetric 
Toeplitz matrices studied by Bose and Mitra. The difference between the two 
is that the symmetric Toeplitz matrices are {2N — 1) x {2N — 1) principal sub¬ 
matrices of the 2N X 2N palindromic Toeplitz matrices. By the rank inequality, as 
N oo the normalized limiting spectral distributions converge to a common value; 
similar arguments relate {2N — 1) x (27V — 1) palindromic Toeplitz and 27V x 27V 
symmetric Toeplitz ensembles. Thus solving either ensemble is equivalent to solving 
the other. □ 


In the ensemble of real symmetric Toeplitz matrices investigated in |HM| . the 
authors assumed bo = 0, as all bo does is shift each normalized eigenvalue by 
bo / -v/TV; this will not affect the limiting spectral distribution. Though the palin¬ 
dromic Toeplitz matrices have &o’s off the main diagonal, the following lemma shows 
that we may again take bo = 0 without affecting the limiting spectral distribution. 

Lemma 4.5. The limiting spectral distribution of the ensemble of real symmetric 
palindromic Toeplitz matrices, with the bi i.i.d.r.v from a probability distribution 
with mean 0, variance 1 and finite higher moments, is unchanged if we additionally 
require bo to equal zero. 

Proof. If bo only occurred on the main diagonal, then its only effect would be to 
shift each normalized eigenvalue by bo/\/N, which is negligible in the limit. The 
argument thus reduces to showing that the two other occurrences of bo (in the upper 
right and lower left corners of our palindromic Toeplitz matrices) have negligible 
effect on the distribution of the normalized eigenvalues. 

The proof follows by multiple applications of the rank inequality iLemma I4.dll . 
Given an TV x TV real symmetric palindromic Toeplitz matrix as in (11.211 . let 
A'j^ be the matrix with entries qT = Oij, except for oin = ajvi = 0, and let B^-i 
be the (TV — 1) x (TV — 1) principal sub-matrix common to both A^ and A'j.^. Thus 
the normalized empirical spectral measures of Ajv and are both within 4/TV of 
that of Bm- 1 , and therefore differ from each other by at most 8/TV. Let now A'fq be 
the same matrix as A'j^ except with the main diagonal entries replaced by 0. The 
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normalized eigenvalues (recall we divide by VN) of and A'l^ differ by ho/VN. 
Therefore 

F^'n/AN(^x) = . (4.5) 

Thus the normalized empirical spectral distributions for Aj\i, A'j^ and A'^ all differ 
by a negligible amount as N oo, so the respective limiting spectral distributions 
of these three ensembles converge to the same distribution. □ 


Remark 4.6. For i G {1,2}, assume fi{N) = o{N). Arguing as in the proof 
of Lemma EH we see our results immediately extend to the limit of ensembles of 
N X N real symmetric matrices where the upper left and lower right blocks may 
be of size fi{N) and f 2 {N), and the main diagonal block (of size (N — fi{N) — 
/ 2 (fV)) X (TV — fi{N) — f 2 iN))) is real symmetric, palindromic and Toeplitz. We 
use Lemma \4.‘A to compare the spectral measure of the N x N matrix with that of 
the related N x N matrix where we have set all entries in the first fi{N) rows and 
columns, and the last / 2 (fV) rows and columns equal to zero. 


4.2. Hankel Matrices. Our results hold for a wider class of matrices. Recall a 
Hankel matrix is of the form 


( bN-\ 

bN-2 

bN-3 ■ 

62 

61 

bo 

\ 

bN-2 

bN-3 

bN-4 ■ 

61 

bo 

b-1 


bN-3 

bN-4 

bN-5 ■ 

bo 

b-1 

6-2 


b2 

bi 

bo 

b-N+5 

b-N+4 

b-N+3 


bi 

bo 

b-1 

b-N+4 

b-N+3 

b-N+2 


V bo 

b-i 

6-2 

b-N+3 

b-N+2 

b-N+1 

/ 


Let Jn be the N x N matrix which is zero everywhere except on the anti-main 

diagonal, where the entries are 1. For example, J 2 = ^ ^ q Note = In 

(where In is the N x N identity matrix), and HnJn is a Toeplitz matrix. If 
additionally bk = 5-fe then HnJn is a real symmetric Toeplitz matrix, and finally 
if the first row of Hn is a palindrome then HnJn is a real symmetric palindromic 
Toeplitz matrix. We shall call such Hn (where bk = 5-fc and the first row is a 
palindrome) real symmetric palindromic Hankel matrices. 

There is a one-to-one correspondence between real symmetric palindromic Toeplitz 
and Hankel matrices. A simple calculation shows that if {Hn,Tn) is such a pair, 
then 

HnJn = JnHn = Tn- (4.7) 

In particular, this implies 

H% = HnInHn = HnJnJnHn = T", (4.8) 


and hence by induction we have that 


TLT^k _ rp2k 

■ 


(4.9) 


To show the spectral measures attached to eigenvalues of real symmetric palin¬ 
dromic Toeplitz matrices converge to the standard Gaussian, all we needed was 
(I1.15|l . There we saw the calculation depends solely on the trace of the even powers 
of our matrices. As there is a one-to-one correspondence. Theorem 11.51 holds for 
real symmetric palindromic Hankel matrices as well. 
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5. An ALMOST SURE CENTRAL LIMIT THEOREM 


We discuss how our results, combined with those of Bose and Mitra, yield a 
version of the almost sure Central Limit Theorem. See |BC| (and the numerous 
references therein) for more details as well as several examples of such theorems. 
We are grateful to the referee for pointing out this application of our results. 

Bose and Mitra analyze the distribution of the eigenvalues of the symmetric 
Toeplitz ensemble (see H4.1II by using the explicit formulas for the eigenvalues. For 
these N X N circulant matrices with entries in {xq, ... ,XAr_i} (with x^-j = Xj), 
if N is odd then the eigenvalues are 


Afe 


1 

7^ 


N-l 

xi cos{2ttM/N) 


— ^N-k] 


(5.1) 


a similar formula holds if N is even (there the eigenvalue \n /2 have multiplicity 
one). We have shown that the limiting distribution of eigenvalues of this symmetric 
Toeplitz ensemble is the same as that of our palindromic Toeplitz ensembles. The 
importance of this connection is that we have shown the convergence is almost sure 
for the palindromic ensemble. Thus we may translate this almost sure convergence 
to a statement about the eigenvalues A^, which are weighted sums of the symmetric 
Toeplitz matrix entries. We thus obtain 


Theorem 5.1. For each N let Xq, ..., Ajv-i be independent, identically distributed 
random variables (subject to the condition that Xj = Xjq-j) from a distribution p 
with mean 0, variance 1, and finite higher moments. For w = (xq, xi,...) set 
Xi{uj) = xi, and consider the probability space (11, P) (where P is induced from 
Prob(A^((v) < x) = J((^p(t)dt). Set 


N-l 


sPiiv) = -=^Xi{uj)cos{2M/N). 
e=o 


Then 


a; S n : sup 

a;GR 


N-l 


k=0 




— <i>(x) 


0 as n ^ oo 


= 1 ; 


(5.2) 


(5.3) 


here I denotes the indicator function and $ is the distribution function of the stan¬ 
dard normal: 


$(x) = 




\/27r j-c 


dt 


(5.4) 


A more useful version of the above is to note that we have double counted all 
the eigenvalues (except possibly one, which will not affect anything in the limit). 
Letting N = 2n and looking at only half the eigenvalues, we immediately obtain^ 
Theorem 11.61 from Theorem ED 


6. Future Work 

So far we have investigated the density of the eigenvalues; we now consider 
another problem, that of the spacings between adjacent eigenvalues. Note the 
palindromic condition means that 0 is always an eigenvalue (because the first and 
last rows are identical), though as N oo, the contribution of one eigenvalue 
becomes negligible. 

As there are only {N — 2) f 2 degrees of freedom for the ensemble of real sym¬ 
metric palindromic Toeplitz matrices, which is much smaller than N(N l)/2, 

^The proof uses the fact that which follows from our normalizations, simple 

algebra, and the fact that Xj = Xjv—j ■ 
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it is reasonable to believe the spacings between adjacent normalized eigenvalues 
^ may differ from those of full real symmetric matrices. The en¬ 
semble of all real symmetric matrices is conjectured to have normalized spacings 
given by the GOE distribution (which is well approximated by Axe~^^ ) whenever 
the independent matrix elements are independently chosen from a nice distribution 
p. Studying thin sub-ensembles opens up the possibility of seeing different behavior. 

Interestingly (see I.TMRRj among others), the spacings between adjacent nor¬ 
malized eigenvalues of d-regular graphs appear to be given by the GOE as well. 
Thus, while the density of eigenvalues of d-regular graphs is different than those of 
all real symmetric matrices (Kesten’s measure versus the semi-circle), the adjacent 
normalized differences between eigenvalues behave like differences of full real sym¬ 
metric matrices. In the opposite extreme, consider band matrices of width 1 (i.e., 
diagonal matrices). There the spacing between adjacent normalized eigenvalues is 
Poissonian (e“^), and the density of normalized eigenvalues is whatever distribution 
the entries are drawn form. 

We chose 40 Toeplitz matrices (1000 x 1000) with entries i.i.d.r.v. from the 
standard normal. The palindromic condition implies that 0 is always an eigenvalue 
of a real symmetric palindromic Toeplitz matrix. To minimize the effect of this 
forced eigenvalue, instead of looking at the middle 11 normalized eigenvalues of 
each matrix, we looked at the next set of 11 eigenvalues. This gave us 10 differences 
between adjacent normalized eigenvalues, and we compared those to the standard 
exponential; if the spacings are Poissonian, the standard exponential should be a 
good fit. Similar results were obtained for larger shifts. See Figures |2 and 0 for the 
plots. 



Figure 2. Differences between normalized eigenvalues 506 
through 516 of 40 real symmetric palindromic 1000 x 1000 Toeplitz 
matrices. 



Figure 3. Gomparison of differences between normalized eigen¬ 
values 506 through 516 of 40 real symmetric palindromic 1000 x 
1000 Toeplitz matrices and the standard exponential; the small 
differences have been suppressed. 

The distribution of differences looks approximately Poissonian; definitely more 
Poissonian than GOE or GUE (both of which have small probabilities of small 














22 


ADAM MASSEY, STEVEN J. MILLER, AND JOHN SINSHEIMER 


spacings). While the fit to Poissonian behavior is not as good as the real symmetric 
Toeplitz matrices investigated in [HMj . it is not unreasonable to conjecture that in 
the limit as ^ oo, the local spacings between adjacent normalized eigenvalues 
will be Poissonian. 
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